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1. 



Field of the Invention 



The invention relates generally to processes for configuring and installing 
products in a data center or warehouse environment. 

10 2. Background 

Companies and other large entities increasingly rely on distributed computing 
where many user terminals connect to one or more servers that are centrally located. 
These locations called "data centers" may be facilities owned by the company or may be 
supplied by a third-party. These data centers house not only computers, but may also 

1 5 have persistent connections to the Internet and thus, conveniently house networking 

equipment such as switches and routers. Web servers and other servers that need to be 
network accessible are often housed in data centers. Where a third-party owns the data 
center, the entity in question rents a "cage" or enclosure that has racks upon which 
assembled/standalone units, such as computers and routers, can be installed. The entity 

20 may also simply lease the units that are rack-mountable from the third-party. In any case, 
the data center is usually divided into a number of predefined areas, including a 
shipping/docking area, assembly area, and area where enclosures and their constituent 
racks are kept. 



25 networking systems involves a series of independent stages. First, based on determined 
requirements, components of the systems are ordered through a vendor or supplier. Once 
the components for these systems are received, inventory logs the "asset" tag for the 
component which identifies it for future reconciliation/audits. While the order for the 
components themselves may identify a number of attributes that each component should 



Typically, the business process of installing and configuring new computer or 
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have (i.e. amount of memory, number of ports, model number etc.), the inventory systems 
often do not, and may only be concerned with the fact that the item was in fact received, 
and what the serial number or other distinguishing identifier is. Conventional asset 
records track accounting information such as depreciation, but not other attribute 
5 information. 

Once a component or set of components is received it is installed in the data 
center. Installation and assembly of components that make up a deployable "asset" is not 
typically performed by those employed in the receiving/warehousing department or by 
those who track inventory. After the component is physically assembled or installed, it 

10 will need to attain a "soft" configuration. The soft configuration includes attributes such 
as the IP (Internet Protocol) address, operating environment and so on. This soft 
configuration information frequently depends upon the attributes of the component. For 
instance, when installing software applications on a computing system asset ("compute 
node"), the operating system image to be deployed may depend on the size of the disk in 

15 the asset. Similarly, the MAC (Media Access Control) address of the network interface 
card may be needed to give the asset a correct IP address. The current environment relies 
on highly skilled employees for all aspects of component assembly and configuration. 
Because such skilled workers are in short supply, the assembly and configuration of new 
components in a data center can take weeks. 

20 The management system is the vehicle and charge of the administrative or 

Information Technology (IT) departments within a large entity such as a corporation. 
The management system must identify, once products are received, what they consist of, 
and how to configure or install them. This information must be either discovered by the 
management system or re-entered into the management system by the skilled workers 

25 who configure and install the component. As is often the case, the skilled assembler must 
take the received components and inspect/test them to find out its attributes and 
configuration because the original order data and the received physical component cannot 
be easily correlated. 

There is thus needed a more efficient configuration process that requires less use 
30 of skilled workers and increases the reliability of the configuration job and time-to- 
deployment of components. 
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SUMMARY 

What is disclosed is a method comprising 1) discovering a new unit deployed 
within a data center; 2) finding a configuration template for the discovered unit; and 3) 
automatically installing software on the discovered unit based upon the configuration 
5 template. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a flowchart of the primary methodology in mapping an inventory 
management system to a configuration management system according to one or more 
embodiments of the invention. 

5 Figure 2 is a flowchart illustrating new unit discovery according to one or more 

embodiments of the invention. 

Figure 3 is a flowchart illustrating associating of a node's configuration with the 
management system according to one or more embodiments of the invention. 

Figure 4 is a diagram illustrating the interaction of the systems involved in 
1 0 implementing the various embodiments of the invention. 

Figure 5 is a diagram of a compute node which can be configured and managed in 
accordance with the various embodiments of the invention. 

Figure 6 is a diagram of a computer implementation of one or more embodiments 
of the invention. 

15 



-4- 



PDNO 10007990 




DETAILED DESCRIPTION 

Referring to the figures, exemplary embodiments of the invention will now be 
described. The exemplary embodiments are provided to illustrate aspects of the invention 
and should not be construed as limiting the scope of the invention. The exemplary 
5 embodiments are primarily described with reference to block diagrams or flowcharts. As 
to the flowcharts, each block within the flowcharts represents both a method step and an 
apparatus element for performing the method step. Depending upon the implementation, 
the corresponding apparatus element may be configured in hardware, software, firmware 
or combinations thereof. 

10 The invention primarily consists of utilizing a management system to control the 

configuration and installation of software on a compute node. The management system 
maintains a database of asset records, and for each node, when the node is first requested 
or ordered, it creates an asset record and asset ID unique to that asset. The asset record is 
associated with the node based upon a certain parameter such the MAC address of the 

1 5 node's NIC. Once a node is deployed it sends out a network request. Based on this 
request, the management system proceeds with a new unit discovery process. The 
management system then finds a configuration template suitable for the node. Finally, 
using the configuration template, software is automatically installed on the node. 

Figure 1 is a flowchart of the primary methodology in mapping an inventory 
20 management system to a configuration management system according to one or more 

embodiments of the invention. First, the inventory or ordering system will build a request 
for units to be deployed in a rack (block 1 10). For instance, if it were determined that a 
computer system needs to be deployed in a given rack, a request for that system is built. 
This type of request typically accompanies an order to a vendor for the components of the 
25 unit. However, the unit can also be built based on components already in inventory. 
Thus, according to block 120, there is check as to whether the units (and their 
components) are in inventory. If the units are not in inventory, the management system 
must wait until the units are in inventory and ready for deployment (block 130). Once the 
units are in inventory, they are installed in the racks and powered-on (block 140). 
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At this point, the node has been bolted into a rack, has been plugged to power and 
networking and has been powered on. By using network messaging (described in detail 
with respect to Figure 2), the new unit will undergo a discovery process (block 150). In 
the new unit discovery, the unit will broadcast a message on the network requesting the 
5 management system to provide it with configuration data. The management system uses 
the information provided by the unit to find a configuration template for the discovered 
unit (block 160). The configuration templates are a series of configuration parameters 
and instructions that are stored/created for different classes or types of units. Depending 
upon the type, model or class of the unit, the management system or other specialized 
10 system (e.g., see software configuration system, described below) will find an appropriate 
configuration template (block 160). 

Once a configuration template is found, the management system or other 
specialized system (e.g., see software configuration system, described below) will install 
software on the unit based on the parameters given by the template (block 170). 

15 Alternatively, the management system may provide the unit with instructions on how to 
install this software. This automatic installation of software is made possible in a data 
center environment partially because the management system database contains 
information about the attributes (such as the MAC address of the network interface card 
(NIC) in the unit). Once the software is installed, the unit can signal to the management 

20 system that it is ready for use (block 180). 

Figure 2 is a flowchart illustrating new unit discovery according to one or more 
embodiments of the invention. At this point the node has been bolted into a rack, an asset 
record (described in detail with respect to Figure 3) has been created, it has been plugged 
to power and networking and it has been powered on. The new unit discovery begins by 

25 checking if the node (unit as installed in the rack) requires soft configuration (block 210). 
An example of such a node is a "compute" node. A compute node is a unit that has large- 
scale data processing (computing) capability such as a personal computer system. Such 
nodes are often characteristic of servers and will often have one or more NICs (Network 
Interface Cards) which allow the node to communicate information on a network. The 

30 primary NIC will send out a network request (e.g. DHCP (Dynamic Host Control 

Protocol) request for an IP address) (block 220) which may also be accompanied by an 
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explicit request for configuration data. This signals the management infrastructure that a 
node is booting up and is ready to be configured. 

The MAC (Media Access Control) address of the NIC is a device signature unique 
to the NIC. The MAC uniquely identifies the NIC to the management system. MAC 
5 addresses are assigned at the time of manufacture and are guaranteed to be globally 
unique. All network messages sent by the NIC contain its MAC address to allow other 
nodes to communicate back to it. When a primary NIC sends out a network request 
message, the management system will compare the MAC sent by the node with all the 
MACs that are known (block 230). The known MACs will be those of devices that are in 

10 inventory or have been received by the company and thus, are present in the management 
system database. If the MAC is not known, then one possible explanation is that an 
intruder has penetrated the network. Thus, in this case of an unknown MAC, the 
management system will begin intruder diagnostics (block 235). Each node with network 
access in a data center must connect to a known good switch, determining the switch of 

1 5 origin will allow the management infrastructure to determine the location of the intruder. 
All unknown MACs are assumed to be intruders until verification is complete and the 
management infrastructure is updated. 

If the MAC is known, then using the MAC as a key (or indexing parameter) the 
asset ID of the node is found (block 240). The next test is to see whether the state 

20 information (associated by and stored along with the asset ID) for the node indicates that 
the node is in the initial state (block 250). The initial state is when the node is first 
installed in a rack. If it is not in the initial state, then a further check is performed to see 
whether the node's state information indicates that it is in a reinstall state (block 260). If 
the node is neither in reinstall nor initial states, then it indicates that the node is 

25 undergoing a reboot. In this case, the node is allowed to proceed with its normal boot 

process (block 270). If the node is either in reinstall state (checked at block 260) or in the 
initial state (checked at block 250), then software needs to be installed. When in a 
reinstall state, the node is configured in a like manner to the initial state with the 
exception that a node needs to be scrubbed (i.e. have its hard drive erased). Hence, to 

30 determine which software to install and the parameters thereof, the management system 
finds an appropriate configuration template for the discovered unit (block 280). 



-7- 



PDNO 10007990 




Figure 3 is a flowchart illustrating associating of a node's configuration with the 
management system according to one or more embodiments of the invention. First, the 
configuration template for a compute node (unit with computing capability) is defined (if 
it does not yet exist) or retrieved (if already present in the system) (block 310). This 
5 includes all optional (e.g. additional NICs, management cards) and configuration 

specifications (e.g. processor speed) for the node allowed by the manufacturer. Next, an 
asset record is created in the management system database with a specific and unique 
asset ID for the node (block 320). The asset record will track the configuration 
information (or pointers to the appropriate configuration template), soft configuration, 
10 state, asset ID, MAC and other pertinent information about the node. Each node has its 
own asset ID and asset record, which are all in one-to-one relationships with another. 
Once the asset record is created, all activities related to the node (which may or may not 
physically yet exist) can be tracked. After the asset record is created, the node is ordered 
or requested (block 330). As detailed information becomes available about the asset, it is 

15 entered in the asset record during each step of its purchase, assembly and installation. For 
example, the kind of processor in the asset or the amount of internal disk can be entered 
when the asset is ordered because that information is known when the purchase order is 
written. The ordering and receipt of the node can also be tracked within the created asset 
record. The management system can check to see if the node is received from the 

20 manufacturer after it has been ordered (block 340). If the node is not yet received, the 

management system must wait for receipt of the ordered node (block 350). If the node is 
received from the manufacturer (or vendor), then the assembly of the components into the 
requested node can be prepared for (for instance, if it has multiple components that need 
to be integrated together) (block 360). As part of this process, the bar-code information 

25 on the components is read and then the data therefrom is associated with the previously 
created asset record (block 370). Additionally, information about the MAC addresses of 
the NIC cards is recorded in the asset record. This allows the management system to find 
the soft configuration template associated with the node during the discovery process. 

Next, the node is associated with the order's corresponding asset record (block 
30 380). This allows the management system to associate other attributes of the node (e.g., 
processor type, amount of memory or internal disk) with the MAC address. The 
management system then waits for the node to be deployed in a rack on the data center 
floor (block 390). At this point the asset ID for the specific node has been associated with 
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all MACs that will be accessing the network from that node. The asset record contains 
the configuration information (or a pointer to the configuration template) so that the 
process of installing and configuring software on the newly deployed node can be 
automatically carried out by the management system (or other dedicated system such as a 
5 software configuration system, detailed below) when it requests configuration 
information over the network as it is powered up. 

Figure 4 is a diagram illustrating the interaction of the systems involved in 
implementing the various embodiments of the invention. At the data center, an internal 
LAN (Local Area Network) Mechanism 430 is used for network communications. LAN 

10 mechanism 430 may consist of mechanisms such as Ethernet for carrying LAN 

information traffic and may include protocols for interaction between users of the LAN, 
such as TCP/IP or IPX. The LAN mechanism 430 ties together various servers, devices, 
nodes and rack locations of the data center. A new compute node 400 may be deployed 
within a given rack and may contain one or more NICs that allow it to communicate over 

1 5 LAN mechanism 430. A first primary NIC of new compute node 400 will connect the 
new compute node 400 to a primary switch 410 which may also be deployed in the same 
rack. The primary switch 410 is a part of the LAN mechanism 430 and connects the 
primary NIC to the LAN mechanism 430. The new compute node 400 may optionally 
have a secondary NIC which will connect it to a secondary switch 420. The secondary 

20 switch 420 may also connect the secondary NIC to the LAN mechanism 430. Alternately 
the secondary switch 420 may connect the secondary NIC to a different LAN mechanism 
or network. 

LAN mechanism 430 allows other systems such a software configuration system 
440 and a management system 450 to be connected to each other and to new compute 

25 node 400. The software configuration system 440 serves applications and performs 
installs of applications to nodes. The management system 450 has database server 
software, which manages asset records that can be stored in a datastore 460 (e.g., a 
database). During new unit discovery, the management system 450 responds to a 
network request from the new compute node 400, once deployed in its rack. The 

30 management system 450 then compares the MAC of the primary NIC of compute node 
400 with a list of MACs for known devices which may be stored in datastore 460. If 
known, the management system 450 finds the appropriate asset ID (and, consequently, 
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asset record) associated with the node 400. It then sends a message to compute node 400 
with pointers (contained in the asset record) to the correct software in the software 
configuration system 440. In one embodiment of the invention, the software 
configuration system may be a tftp (Trivial File Transfer Protocol) server. The compute 
5 node then requests the software configuration system for the software and loads it. 
Depending on the configuration, the node may also request other software from the 
software configuration system, or alternatively, the software configuration system may 
install other software on node 400. 

The management system 450 is also responsible for tracking and maintaining state 
10 information regarding the new compute node 400. This state information can be stored in 
datastore 460 in an asset record corresponding to the new compute node 400. If the 
management system 450 determines, for instance, that the new compute node 400 is in an 
initial state, it will initiate software configuration system 440. The management system 
450 will find a configuration template that corresponds to the asset class/type of the new 
15 compute node 400 which would be designated in its asset record. The configuration 
template that is found will then form the basis by which the software configuration 
system 440 decides how and what software will be installed onto new compute node 400. 
The software configuration system 440 then installs, automatically, the desired software 
onto the new compute node 400. 

20 The management system 450 also initially creates the asset record at the time the 

new compute node 400 is requested or ordered, and maintains in that asset record any 
post-deployment information that would be desirable for further installation, monitoring 
or maintenance of the new compute node 400. The software configuration system 440 
will contain installable versions of the software that is to be installed on nodes and 

25 application software that controls the installation process. 

Figure 5 is a diagram of a compute node which can be configured and managed in 
accordance with the various embodiments of the invention. The compute node 500 has a 
number of components such as a CPU (Central Processing Unit) 510 and RAM (Random 
Access Memory) 520. The compute node 500 also has a bus 580 that allows these 
30 components and others to communicate with each other. For instance, compute node 500 
is shown having two NICs, a primary NIC 540 (so called because it is in the primary slot) 
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and a secondary NIC 550. Each of these NICs are connected to other components within 
the node and to a LAN (Local Area Network) 590. LAN 590 is shown merely as an 
example of the possible networks that the NICs may connect to. Each of NICs 540 and 
550 may instead connect to separate networks. For instance, the primary NIC 540 may be 
5 connected to LAN 590 while the secondary NIC 550 is connected to a WAN (Wide Area 
Network) such as the Internet. Bus 580 also connects other peripheral components such 
as a disk 530, which is non-volatile storage mechanism such as a hard drive. 

In accordance with the invention, the compute node 500 may be assembled of the 
components — such as CPU 510, RAM 520, disk 530, primary NIC 540 and secondary 

10 NIC 550. Prior to assembly, the bar-code information for these components may be 

scanned and used to create asset record. When finally deployed, the compute node 500 
will send a network request message through either NIC 540 or NIC 550. The 
management system will located the correct soft configuration information for the node 
using the MAC address of the NIC that sent the request. Next, the management system 

15 and software configuration system will install applications onto disk 530 of node 500 
through one or both of the two NICs 540 and/or 550. If the MAC address of the NIC is 
not known to the management system, the management system may flag the request as a 
possible intrusion, and start appropriate security measures. Once these applications, such 
as operating system software, are configured on the node 500, it is then completely 

20 deployed as an operational part of its rack and of the data center in which its rack is 
housed. The CPU 510, RAM 520 and/or disk 530 may be of such a type, speed and 
capacity that would warrant installing only certain software or only certain optimized or 
un-optimized versions of the same software. The management system would be able to 
determine such parameters of the install based upon the asset information about the node 

25 500 that is contained in its asset record. 

When the compute node 500 boots, the components attached to the internal bus 
580 become active in a specific order. Ordinarily, the primary NIC 540 being in the 
primary slot becomes active and can communicate with the LAN 590 before the compute 
node 500 is fully booted. This allows for the primary NIC 540 to act as a gateway for a 
30 new soft configuration for the node 500 to be done (soft configuration includes network 
identity, operating system, applications, etc.). 
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Figure 6 is a diagram of a computer implementation of one or more embodiments 
of the invention. Illustrated is a computer system 607, which may be any general or 
special purpose computing or data processing machine such as a PC (personal computer), 
coupled to a network 600. One of ordinary skill in the art may program computer system 
5 607 to act as a management system server and/or a software configuration system server. 
The management system server and software configuration system server, are, in 
accordance with some embodiments of the invention, two separate and independently 
operating systems. However, it will be readily apparent that the functionality of both the 
management system and the software configuration system can be integrated onto as 
10 services of a single physical computer system such as system 607. According to one or 
more embodiments of the invention, the system 607 or systems similar to it, would be 
programmed to perform the following functions when implementing a management 
server: 



Building an asset record for an ordered/requested node; 



15 



Receiving a network request from a deployed node; 



Comparing the MAC associated with received network requests with known 
MACs; 



Interacting, managing and maintaining a database of asset records; 



Determining, maintaining and updating state information regarding nodes; and 



20 



Finding a software configuration template that corresponds to a node needing 



software installation. 



According to one or more embodiments of the invention, the system 607 or 
systems similar to it, would be programmed to perform the following functions when 
implemented as a software configuration system server: 



25 



Reading parameters contained in a software configuration template; 



Installing software applications on nodes needing such installation; 
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• Reinitializing non-volatile storage mechanisms in nodes already having 
installed software but desiring a re-install; 

Configuring said software applications during and after installation; and 

• Upgrading or reconfiguring installed software applications on nodes when so 
5 desired. 

In either role, system 607 has a processor 612 and a memory 611, such as RAM, 
which is used to store/load instructions, addresses and result data as desired. The 
implementation of the above functionality in software may derive from an executable or 
set of executables compiled from source code written in a language such as C++. The 

10 instructions of those executable(s), may be stored to a disk 618, such as a hard drive, or 
memory 611. After accessing them from storage, the software executables may then be 
loaded into memory 61 1 and its instructions executed by processor 612. The result of 
such methods may include calls and directives in the case that the asset records (and 
related information such as software configuration templates) are stored on disk 618, or a 

15 simple transfer of native instructions to the asset records database via network 600 if it is 
stored remotely. The asset records base may be stored on disk 61 8, as mentioned, or 
stored remotely and accessed over network 600 by system 607. Also, installable versions 
of software applications that are to be installed on deployed nodes may be stored on disk 
618, as mentioned, or stored remotely and accessed over network 600 by system 607. 

20 Computer system 607 has a system bus 613 which facilitates information transfer 

to/from the processor 612 and memory 61 1 and a bridge 614 which couples to an I/O bus 

615. I/O bus 615 connects various I/O devices such as a network interface card (NIC) 

616, disk 618 and to the system memory 61 1 and processor 612. The NIC 616 allows 
software, such as server software, executing within computer system 607 to transact data, 

25 such as requests for network addressing or software installation, to nodes or other servers 
connected to network 600. Network 600 is also connected to the data center or passes 
through the data center, so that sections thereof, such as deployed nodes placed in racks 
and management and software configuration systems, can communicate with system 607. 

The exemplary embodiments described herein are provided merely to illustrate the 
30 principles of the invention and should not be construed as limiting the scope of the 
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invention. Rather, the principles of the invention may be applied to a wide range of 
systems to achieve the advantages described herein and to achieve other advantages or to 
satisfy other objectives as well. 
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